
# Read Me for Partisan Disparities in the Use of Science in Policy


##Code

The following Scripts in Code file produce the analyses presented in the main text and supplementary information

	1_OverallScienceCiteGrowth.R -- script produces the overall estimates of the increase in science citation in our data over time. 

	2_CommitteeDataAnalysis.R -- script produces estimates of the difference in citation rates to science by congressional committees. 

	3_ThinkTankDataAnalysis.R -- script produces estimates of the difference in citation rates to science by ideological think tanks. 

	4_PaperDifferencePlots.R -- script produces estimates of the differences between the science cited by the left and right, including the overlap analyses. 

	5_EmbeddingAnalysis.R -- script produces results using on SPECTER embeddings to visualize cited science and analyse clusters of cited science across committees and issues.

	6_MatchingAnalysis.R -- script produces results based on matched pairs of co-partisan and out-partisan policy documents. 

	7_TrustAnalysis.R -- script produces results based on the Survey of Political Elites and Public Servants. 

	8_WitnessAnalysis.R -- script produces supplementary analysis of witnesses called by congressional committees using data from Ban, Park, and You (2023) 

	9_GrandstandingAnalysis.R -- script produces supplementary analysis of the relationship between grandstanding and citation to science using data from Park (2021)


##Data

The following files include data used by the analysis scripts above:

	CleanedDeIdentifiedSPEPS02022021_Science.csv -- survey data from SPEPS

	Committee_documents_FOR.RData -- committee documents by field of research of cited science. List with an entry for each FOR.

	Committee_Party_Citation_Clusters.RData -- committee cited science labelled by the party of the citing committee with cluster assignments based on their SPECTER embeddings.  List with an entry for each committee. 

	CongressDocumentClassifications.csv -- Overton issue classifications for committee documents

	CongressDocumentsAnalysisData.csv -- Overton committee citation data aggregated to the document level.

	CongressDocumentsAnalysisSupplementaryData.csv - Overton committee citation data aggretagetd to the document level, merged with Ban, Park, You (2023)

	CongressDocumentsCitations.csv -- Overton data on committee documents with their citations via DOI. 

	CongressMatchedPairCitationSimilarities.csv -- Matched pairs of congressional documents with their pairwise citation similarities. 

	CongressPaperFOR.csv -- Field of Research for the papers cited by Congress. 

	CongTTDOI.csv -- SPECTER paper id to DOI crosswalk. 

	CongTTPaperEmbeddings3272023.csv -- SPECTER embeddings for all Congress and think tank cited papers

	dfp_furnas_micro - dfp_furnas_micro.csv -- public opinion survey data fielded in tandem with SPEPS. 

	Dimensions-Fields-of-Research-2020-10-28_14-28-57.csv -- Dimesions Field of Research names crosswalk. 

	DtoDCommitteeMatches3.csv -- matched pairs of Dem to Dem committee documents.

	DtoRCommitteeMatches3.csv -- matched pairs of Dem to Rep committee documents. 

	hearings_apsr.dta -- Ban Park You (2023) replication data.

	LtoLThinkTankMatches3.csv -- matched pairs of Left to Left think tank documents. 

	LtoRThinkTankMatches3.csv -- matched pairs of Left to Right think tank documents. 

	OvertonCitedPapers.csv -- Dimensions paper data for the science in the Overton data. 

	OvertonCommitteeNames_Standard.csv -- Overton committee name crosswalk to standardize. 

	ParkGrandstandingMatchedData.csv -- Committee hearings data from Overton matched with the Park (2021) grandstanding score. 

	RtoDCommitteeMatches3.csv -- matched pairs of Rep to Dem committee documents. 

	RtoLThinkTankMatches3.csv -- matched pairs of Right to Left think tank documents. 

	RtoRCommitteeMatches3.csv -- matched pairs of Rep to Rep committee documents. 

	RtoRThinkTankMatches3.csv -- matched pairs of Right to Right think tank documents. 

	Think Tank EINs.csv -- codings for the ideology of think tanks.

	ThinkTank_documents_FOR.RData -- think tank document by field of research of cited science. List with an entry for each FOR.

	ThinkTankClassification_Citation_Clusters.RData  -- think tank cited science labelled by the party of the ideology of the think tank with cluster assignments based on their SPECTER embeddings. List with an entry for each think tank issue. 

	ThinkTankDocumentClassifications.csv -- Overton issue classifications for think tank documents. 

	ThinkTankDocumentsAnalysisData.csv -- Overton think tank citation data aggretagetd to the document level.

	ThinkTankDocumentsCitations.csv -- Overton data on think tank documents with their citations via DOI. 

	ThinkTankMatchedPairCitationSimilarities.csv -- Matched pairs of think tank documents with their pairwise citation similarities. 

	ThinkTankPaperFOR.csv -- Field of Research classifications for think tank cited papers. 

	TopMatchedThinkTankEmbeddingsForClustering.csv -- Embeddings for science from the highly similar matched think tank documents for clustering. 

	USAPolicyDocumentsForAggFiltered.csv -- Larger set of Overton policy documents for use in Script 1. 

	VoteUnityScores.csv -- Party Vote Distinctiveness scores from Crosson, Furnas, Lorenz (n.d)

	ZetaBasedPoliticizationScores.csv -- Party Speech Distinctiveness scores from Crosson, Furnas, Lorenz (n.d)


##Computing environment

Code was run on a MacBook Pro running MacOS Sequoia 15.3 using R version 4.4.1 (2024-06-14) -- "Race for Your Life"

##Dependencies

gtsummary       jtools          dplyr           tidyr           tibble         
plotly          MASS            rstatix         ggpubr          Matrix         
base            stats           sjPlot          cowplot         fixest         
ggeffects       ggplot2         lme4            modelsummary    purrr          
readr           vtable          broom           marginaleffects graphics                   DIDmultiplegt   fect            forcats         grDevices      
panelView       stringr         sjstats         congressTools   ggdist         
lubridate       vroom           proxy           cramer          htmlwidgets    
randomcoloR     Rtsne           StatMatch       utils           boot           
ClusterR        ggrepel         questionr       Hmisc           eRm            
survey          weights         readstata13